Combining lexicon-driven parsing and phrase-structure-based parsing
نویسنده
چکیده
(~l-J ([ti'~rna ar), which dn r~ol have e×plicil phrase slitil~ilJre tales, ai'~ sutlable for higher le, vel syritax but eel far low level b.liigu~ljo-wlitch sootri to require lhe power of phrase .~iti°!Jollire rrilos. 'Fhlt,; fJ~'~i)ei' t31"r).'lenl!~ an h+iiploi~iolilaihiu nloll'ied itJ ceiTil:~llle lext0ul!., ddvoii parsllig ~lild phras(t D, triJcluro parsing, with a si)cclal riitl~liis ~;alled ~#raph.,sfiuctisr'ed stack. I.fo~>get llngui,~;lic ~rafnrear toeilalisms earl be classilled halo Iwe f~liillles, pht~]se-,<;tructure-based or lexlcorkdriven.-[bt~ phrase ° ~trl.lCtuie.i)ased foriiialisrlis hiclude Oeflnile Clause (-lrarnn-lar [IJ], Le×fcal Ftnctioaal Granimar[3], Gerleralized Phrase L~tructuro (~raiillTlar [.'J, 6 i alld F:tinclierial Uriificatioa Granlmar t7]. ]hey all are h~,,.;~-i(; eu context-free phrase structure rules wliich ar~-) ~ugieelltet'~ iH ella way or arlether. On lbe olher 1lend, Io×icoli-driw;ii tei'inalh~i~s iucledo Catogorial OraaIrrlar[l], tlead-diivoli Phi'a~Jo f~trdclere (-iralnnlar [9, 10| arid GtJ-(71rammar [4, 13]. All e! tliese lexicoll..drivon forMalislos do riot have any explioil ptiadltJ stuJClllre rillos, but lulorfuatioe about bow to ceicbirie oorisilllleii~f~ inlo a high~;r coastlitJerlt is ella:cried ill each lexicoe or COilSiitLl~!~lit which is I(i be combined. It Is elicit argued that Ioxicerl.-ddverl trll'lrl~rtlisnls cEIn haiidle sonI¢7 Iieguistlc pll(.~rlOlllentt lJtlCl/ a{'l free-word-order Ifiero elogaritly, aud also they can capture uriiversalily (if rnlilliple languages, as described, for exarriple, by Wehili t14]. Iil section 2, on lhe other I'land, it is argued that, while lexioon-driven forinalisins might be suitable 1o cope will] linguistically "intereslh'ig" phenonioea, they lack the power to express linguistically "enieteresting" phenoe~eaa which are ellen very specific to a pallicular language, Unfollunaiely, in order to build a praclical pai~er for a "real" taxi, one must haudle wry irlany "uainteresting" low level pPlenornena which canaot be ea,,ill 7 ~ormalized hi the lexicon-driveri way. Heriee, tf we Wahl Io build ~ praclical p~lrser wllh ~ theory behind lhe Ioxicori-drlven tor,i~alisms, iheri it ls ttssenllal to eat'tibiae lexicon-driven parshlg (for higher level synti~,") and phlase-structere-based parsing (lor low level delall and other liingeago-specilic/exceptioiral coiistrtictioris). In seelion 3, we geaeralize lho compulalleaal models of all the iexicoe-driveu urid phrase..st reclure-based terrnaltsiris as shift.reduce parsing. 8eclioa 4 hllreduces the 9raph-structur~Jd stack to handle rlori-deterrniaisrn in shill-reduce parsing, in soclloes <5 r!ud 6, we doscrib~J the use of tile grapli..sltuclured slack Iri I..exlcou ddvon and i:~llrase-strucltne-based parsiug, respecllvely. We then diucuss trlow to cembiae lhese lwo kinds of parsinfj wilh lho graph-struclurod stack, in secllen 7. lit It)rau{ical af)pllcatieii.% llii)lii seriterictt,-J have> iliaiiy Jali[flll!,.j~:, ~-ip~.cilic ph~)ilOlilOili~ ihal aro riot Iin.quislioaliy 'iiltoiu~lin{j'L COl isidt~r iIi{+, t,.ill~:whl~} s~ntendo8: "i'ak~ lwo …
منابع مشابه
Feature Engineering in Persian Dependency Parser
Dependency parser is one of the most important fundamental tools in the natural language processing, which extracts structure of sentences and determines the relations between words based on the dependency grammar. The dependency parser is proper for free order languages, such as Persian. In this paper, data-driven dependency parser has been developed with the help of phrase-structure parser fo...
متن کاملA Data-Oriented Parsing Model for HPSG
Data Oriented Parsing (DOP) is based on the idea of processing new input by combining fragments (associated with some probabilities) that are extracted from a treebank. In the simplest case these fragments are subparts of simple phrase structure trees (Tree-DOP). The approach is attractive in many ways but the impoverished representational basis is a serious drawback from a linguistic point of ...
متن کاملCOGPARSE: Brain-Inspired Knowledge-Driven Full Semantics Parsing - Radical Construction Grammar, Categories, Knowledge-Based Parsing & Representation
Humans use semantics during parsing; so should computers. In contrast to phrase structure-based parsers, COGPARSE seeks to determine which meaning-bearing components are present in a text, using world knowledge and lexical semantics for construction grammar form selection, syntactic overlap processing, disambiguation, and confidence calculation. In a brain-inspired way, COGPARSE aligns parsing ...
متن کاملThings between Lexicon and Grammar
A number of grammar formalisms were proposed in 80’s, such as Lexical Functional Grammars, Generalized Phrase Structure Grammars, and Tree Adjoining Grammars. Those formalisms then started to put a stress on lexicon, and were called as lexicalist (or lexicalized) grammars. Representative examples of lexicalist grammars were Head-driven Phrase Structure Grammars (HPSG) and Lexicalized Tree Adjoi...
متن کاملتأثیر ساختواژهها در تجزیه وابستگی زبان فارسی
Data-driven systems can be adapted to different languages and domains easily. Using this trend in dependency parsing was lead to introduce data-driven approaches. Existence of appreciate corpora that contain sentences and theirs associated dependency trees are the only pre-requirement in data-driven approaches. Despite obtaining high accurate results for dependency parsing task in English langu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1988